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Abstract. An approximation to the solution of a stochastic parabolic equation is con- 
structed using the Galerkin approximation followed by the Wiener Chaos decomposition. 
The result is applied to the nonlinear filtering problem for the time homogeneous diffusion 
model with correlated noise. An algorithm is proposed for computing recursive approxima- 
tions of the unnormalized filtering density and filter, and the errors of the approximations 
are estimated. Unlike most existing algorithms for nonlinear filtering, the real-time part of 
the algorithm does not require solving partial differential equations or evaluating integrals. 
The algorithm can be used for both continuous and discrete time observations. 
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1. Introduction 

Let (fi, J 7 , P) be a probability space with a Wiener process W = W(t). Consider a random 
field u = u{t,x), < t < T, x e R d so that u e L 2 (Q; C([0, T); L 2 (R d ))) and u(T,-) is 
measurable with respect to the sigma-algebra Fjf generated by the Wiener process up to 
time T. If {e^, k > 1} is an orthonormal basis in L2(M. d ) and {£ m , m > 1} is an orthonormal 
basis in L 2 (fl, J 7 ^), then we can write 

k / M 



k=l \m=l 



where ipk m {T) are some deterministic coefficients. The objective of the current work is to 
study approximation ( |1 . 1| ) when the random field u is a solution of a stochastic parabolic 
equation. For such random fields it is possible to derive explicit representation for the 
coefficients (fkm and get an upper bound on the approximation error in ( |1 . 1| ) . The results 



are then used to derive an approximate algorithm for solving the nonlinear filtering problem 
of diffusion processes with correlated noise. 

The problem of nonlinear filtering can be briefly described as follows. Assume that (X = 
X(t),Y — Y(t)), t > 0, are two diffusion processes with values in M d and M. r respectively, so 
that X is the unobservable component and the observable component Y is given by 

Y(t) = [ h(X(s))ds + W(t). 
Jo 

The problem is called noise-uncorrelated, if the Wiener process W = W(t), representing the 
observation noise, is independent of X. The problem is called noise-correlated, if there is 
correlation between W and X. If / = f(x) is a measurable function satisfying E|/(X(t))| 2 < 
oo, t > 0, then the problem of nonlinear filtering is to find the best mean square estimate 
ft of f(X(t)) given the trajectory Y(s), s < t. It is known [11], [L4], |22| that, under certain 
regularity assumptions, we have 

/ x 2 n j t = Ldf(x)p(t,x)dx 

1 J Rd p(t,x)dx 

where p = p(t,x) is a random field called the unnormalized filtering density (UFD). The 
problem of estimating f(X(t)) is thus reduced to the problem of computing the UFD p. It 
is also known [^!| that p = p(t, x) is the solution of the Zakai filtering equation, a stochastic 
parabolic equation, driven by the observation process. The exact solution of this equation 
can be found only in some special cases, and the development of numerical schemes for 
solving the Zakai equation has become an area of active research. 

Many of the existing numerical schemes for the Zakai equation use various generalizations of 
the corresponding algorithms for the deterministic partial differential equations. Examples 
of the corresponding algorithms can be found in Bennaton [jlj, Florchinger and LeGland 
||, Ito |10[ , etc. Because of the large amount of calculations, these algorithms cannot be 
implemented in real time when the dimension of the state process is more than three. 

In some applications, like target tracking, the filter estimate must be computed in real 
time. Such applications require filtering algorithms with fast on line computations. When 
the parameters of the model are known in advance, the real time computations can be 
simplified by separating the deterministic and stochastic components of the Zakai equation 
and performing the computations related to the deterministic component in advance. The 
separation is based on the Wiener chaos decomposition of solutions of stochastic parabolic 
equations. Starting with the works of Kunita |TB|, Ocone PH , and Lo and Ng [TjJ, this 



approach was further developed by Budhiraja and Kallianpur [0, 13, 0J and Mikulevicius and 



Rozovskii |17|, [18], [L9| ^C|. An algorithm to solve the Zakai equation using this approach 



for the noise uncorrelated problem was suggested in Lototsky et al. |16fl . The algorithm in 
ITBll was based on the following approximation. First, the unnormalized filtering density was 
approximated using the Wiener Chaos decomposition: 

M 

(1.3) p{t, x) W ^ fm(t, x)£ m . 

m=l 
2 



After that, the coefficients ip m were expanded in the basis {e^} in L 2 (M> d ), resulting in the 
approximation 

m f K \ 

(1.4) p(t, x) w I J] V 3 m,jfc(t)e fc (a;) I f m . 

m=l \fc=l / 

In other words, first, the stochastic variable was separated, and then, the spacial variable. 
Alternatively, one can start with the Galerkin approximation of p: 

K 

(1.5) p(*,aO«XX(*)e*(a:). 

k=l 

The coefficients p^(t) satisfy a system of stochastic ordinary differential equations driven 
by the observation process. The solution of this system is then expanded using the Wiener 
Chaos decomposition, resulting in the approximation of the type (|1.1|): 

K f M \ 

(1.6) p(t, x) « J2 E <k(t)U e k (x). 

k=l \m=l / 

In other words, first, the spacial variable is separated, then, the stochastic variable. The 
optimal filter is then approximated by 



;i-7) /* 



J2k=i (E m=l 

Ef=l (Em=l ^m,ifc(0Cm) "fe 



where fk = J Rd f(x)ek(x)dx, oik = f Rd ek(x)dx. First suggested in || as a computational 
alternative to (|1.4j ), approximation ( |1.6|) was further analysed in [|J. 

The order in which the variables are separated does make a difference. The algorithms 
based on approximations (|1.4| ) and Q1.6Q have different approximation errors and, unlike 



~4[) , analysis of (|1.6|) is possible for noise correlated problem. 
Recall that the Zakai filtering equation for the unnormalized filtering density p = p(t, x) is 
(1.8) dp = C*pdt + M*pdY(t). 

The elliptic differential operator £ is the generator of the unobserved process X, while 
the operator M. is bounded in the noise uncorrelated problem and in unbounded in the 
noise correlated problem. The presence of the unbounded operator in the stochastic part of 
equation ( |1.8| ) for the noise correlated problem makes the analysis and implementation of the 
numerical methods for the Zakai equation much more difficult (see, for example, Florchinger 
and LeGland ||). 

The objective of the current work is to analyze the algorithm for solving the Zakai equation 
using approximation (|1.6|). First, (|OJ) is studied for an abstract stochastic evolution system. 
In Section 2, the Galerkin approximation is investigated, and in Section 3, the Wiener chaos 
decomposition for a system of stochastic ordinary differential equations. In each situation, 
the rate of convergence is established in terms of the numbers K and M of the basis function 
used. The filtering problem is introduced in Section 4, the filtering algorithm is presented in 
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Section 5, and the convergence of the algorithm is studied in Section 6. The real time part 
of the proposed algorithm does not require solving differential equations or using quadrature 
methods to evaluate integrals in (|1.2j ). The algorithm can also be used if the observations 
are available in discrete time. 

Unlike the previous works on the subject, this paper presents a unified treatment of both 
noise-correlated and noise-uncorrelated problems with possibly degenerate diffusion in the 
un-observed component. Another difference from the previous works on the subject is that 
the error bound is derived not only for the filtering density but also for the optimal filter f t 
with a large class of functions /. 

2. Galerkin approximation of stochastic evolution equations 
Consider the stochastic evolution system 



where W = W(t) is an r - dimensional standard Wiener process on a complete probability 
space (fi,^ 7 , P), m is independent of W, and A and Bi, I — 1, ... ,r, are linear operators 
acting in the scale of infinite dimensional Hilbert spaces {H a , a G M}. To simplify the 
notation, both the inner product in H° and the duality between H 1 and H _1 will be denoted 
by (•, -) ; || • || is the norm in the space H a . It will be assumed that equation ( |2.1|) is either 
coercive or dissipative |22[ Chapter 3]. In particular, there exists a constant C* > so that, 
for every v G H 1 , 



(2.2) \\Av\\-i < C*\\v\\i, \\Biv\\ < C*\\v\\i, &ad2(Av,v) + ^2\\B l v\\l<C*\\v 



If Uq G H 1 , then there is a unique solution u = u(t) in the space L 2 (fl x [T^T];!! 1 ) PI 
L 2 {Q; C([T ,T];H )) (see Theorems 3.1.4 and 3.2.2 in 0). 

Suppose there exists an orthonormal basis {e k , k >} in H° so that G H 1 for all k. Consider 
the following system of stochastic ordinary differential equations: 



(2.1) 




r 



1=1 



K 



)XAe n , e k ) u^(t)dt 



(2.3) 



n=l 
r K 



+ 




1=1 n=l 



(u ,e k ) , k 



0,...,K. 



The function 



A' 




k=l 

4 



is called the Galerkin approximation of u(t). It is proved in the following theorem that, 
under some natural assumptions, 

lim sup E||u(£; T ; u ) — u K (t) = 0, 

K ^°°T <t<T 

and the rate of convergence is determined. 

2.1. Theorem. Let the following conditions be fulfilled: 

1. The basis {e k } consists of the eigenfunctions of a linear operator A with the corre- 
sponding eigenvalues \ k . The operator A is a symmetric operator in T~C° and there 
exist numbers < c\ < C2 and 9 > so that, for all k, 

(2.4) ci < X k k- d < c 2 ; 

2. e k e H 1 and \\e k \\i < C e k q , q > 0; 

3. sup T()<t<T IE 1 1 A u u{t) ||q < oo for some positive integer v so that Q\ := v6 — 2q> 1. 
Then 



(2.5) sup E\\u(t)-u K (t)\\ 2 < sup E\\A v u(t)\\ 2 , 

Tr,<t<T Tr,<t<T K^" 1 L > 



where C is a constant depending only on the constant C* in ( \2.8j ) and the numbers 
C!,c 2 ,C e , v, 9, q. 

Proof. liip k {t) ■= (u(t),e k ) , then 



K 



(2.6) E\\ u (t) - u K (t)\\i = j2nMt) - 4{t)\ 2 + nMt)\ 2 - 

k=0 k>K 

By assumptions 1 and 3 of the theorem, 

(2.7) |« t )| < 
so that 

(2.8) sup ^E|^(t)| 2 < sup E||AX<) 112^=1 < sup E||A^(t)||2_^- 

To<t<Tf~t T <t<T & lVt> 1 T <t<T V) 



k>K 



For 1 < k < K define 5 k (t) := tj) k {t) -< (t), so that £f =1 E|V>*(t) - wf (t)| 2 = Ef=i E l4| 
and also define 

: = y^(-4e fc ,e n )o^ fc (^), 4,nW := y^(^e fc ,e n )o^ fc (t). 



Both Sx tn (t) and 5 l 2 n (t) are well defined due to ( |2.7[ ) and assumptions 2 and 3 of the theorem. 
Then 



K r K 



d $n{t) = ^(Ae k , e n ) Q 8 k (t)dt + 5Z(£«e fc , e n ) 4(t)dWi(t) 

k=l 1=1 k=l 

(2.9) _I_ 

+ 6 ltn {t)dt + ^ T <t<T; 

K(T ) = 0, l<ra<#, 
and by the Ito formula, 

K t K 

^E|<5„,(t)| 2 = 2 {Ae k ,e n ) E5 n {s)6 k {s)ds 

n=l n,fc=l 

r K t I K \ 2 JC t 

(2.10) E E^OoW + E5 1;n (s)5 n (s)ds 

1=1 n=l J T ° \k=l J n=l 

+2 EE / (Bie k ,e n ) E5 l 2)n (s)5 k (s)ds + Y,J2 H^ n (s)fds. 

1=1 n ,k=l Jt ° 1=1 n=l jT ° 



It follows from the third inequality in ( [2.2|) that 

ft K 

2 ^ (>4.e fc , e n )oE8 n (s)8 k (s)ds 

n /,■ I 

^ / r K i / K \ * K 

+ ^[J2(13ie kl e n ) 8 k (s)\ ds < cf^ [' Hh(s)) 2 ds. 

l=ln=l^ T \k=l J k=l^ T ° 

The first two inequalities in ( [2.2|) and assumption 2 imply 

|(^4e fe ,e n )o| < C||e fc ||i||e n ||i < Ck g n g , \ (Bie k , e n ) \ < C||e fe ||i||e n || < Ck 9 , 
so that by (g/7p, 

\8i,n[t)\ < Ctf Rve _ q _ x , \5 2 J < C Rve _ q _ x , 

and 

<r E(5i, n (s)Yds+j2J2 1 n^ n {s)fds 

(2.12) n=l jT ° l=ln=l jT ° 

<(T-T ) sup E\\A"u{t)\\lj^S- 

T <t<T ' 

After that ( |2.1(J| )-( |2~T^ ) and the obvious inequality 2\ab\ < a 2 + b 2 imply 

J2m(t)\ 2 <cf2 fmis^ds+iT-To) S u P eiia^is^^, 

n=l n=l ^ T ?b<t<T -^ 



so that by the Gronwall inequality 

K 



sup £E|5„(t)| 2 <(T-T ) sup E||A^(t)|| 2 e^)^±i)£. 

T <t<T , T <t<T A^^ 1 L > 

n=l 



Together with ( |2.6| ) and ( |2.8| ), the last inequality implies ( |2.5| ). Theorem |2.1| is proved. □ 



3. Wiener Chaos Expansion 

On a complete probability space (Q, JF, P) consider a system of stochastic ordinary differential 
equations: 

(3.1) U{t) = U + I AU(s)ds + ! ^B l U{s)dW l {s) T > 0, 

JTq JTo 

where U(t),Uo G M^, A, I?; e 1^*^, the matrices A, I?; are deterministic, and Uq is in- 
dependent of the r-dimensional Wiener process W. The solution of Q3.1J ) is denoted by 
U(t;T ;U ). 

In what follows, the Wiener chaos decomposition of U(t;T ;Uo) will be derived and the 
properties of the decomposition studied. 

As the first step, recall the construction of an orthonormal basis in the space L2^fl, J 7 ^ t , P) 
of square integrable random variables that are measurable with respect to the a-algebra, 
generated by the Wiener process up to time t. Let a be an r-dimensional multi-index, 
that is, a collection a = (a4)i<z<r, fe>i of nonnegative integers such that only finitely many 
of a l k are different from zero. The set of all such multi-indices will be denoted by J. For 
a E J define a\ := rLz^LO- 

For a fixed t* > To choose a complete orthonormal system {m^} = {rrik{s)}k>i in L2QT0, £*]) 
and define 

£k,i = / m k (s)dWi(s) 

so that C.h i are independent Gaussian random variables with zero mean and unit variance. 
If 

d n 

(3.2) H{x) := (_i)« e - 2 /2 « -x 2 /2 

\AjJu 

is the n-th Hermite polynomial, then the collection 



k,l \ V "fe- 

is an orthonormal system in ^(O, t », P). A theorem of Cameron and Martin || shows 
that {^(Wj],^)}^; is actually a basis in that space. 
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3.1. Theorem. J/77 G L 2 (0,^f t ,,P), then 

(3-3) 77 = ^ E [^(^T ^)]^(^To^) 

aeJ 

and 

Proof. This theorem is proved in |5| and M. □ 



3.2. Theorem. If t* > T fixed, then, for every s G [T ,i*] ; ine solution U(s;T ;Uo) 
can be written as 

(3.4) U(s;T ;U ) =^-^=( Pa (s ] T ',U )Z a {W>i h ,t-), 



, v Oil 



and t/ie following Parseval's equality holds: 

(3.5) E|C/( S ; T ; C/ )| 2 = V ^r£b«(s; T ; [7 )| 2 . 



The coefficients of the expansion are R K -vector functions and satisfy the recursive system of 
deterministic equations 

dtp a (s,T , U ) = ^ a ( s; r ;£7 ) + \^t4m fc (s)S I ^ a ( fci j)(s;To;C/'o), T o < s < t*; 
(3.6) ds Tt 

Pa(To;T ;U Q ) = ?7ol{|a|=0}, 
where a = (a4)i<z<r, ^>i G J and a(i,j) stands for the multi-index a = (a4)i<z<r, fc>i 

a4 if 7^ i or Z 7^ j or both 

max(0, a{ — 1) if k = i and / = j. 



(3.7) a 



k 



Proof. Assume first that Uq = g is deterministic; the Markov property of the solution of 
( |3.1|) implies that, once the derivation is complete, we can replace g with [To- 
ff g is deterministic, then U(s;T ;g) G L 2 (f2, .Fjf t , , P) for s < t*, and Theorem |3.1| implies 



UD and (U). 
To prove that the coefficients satisfy (|3.6[), define 

P t (z)=exp{ f Y^m^dW^s) - \ fY^\m l z {s)\ 2 ds\, T <t<t*, 
Jt l=1 Jt l=1 

where m l z = J2k>i m k(. s ) z l an d {<4} ; I = 1, ■ ■ ■ ,r, k = 1, 2, . . . , is a sequence of real numbers 
such that Yliki I -4 1 2 < 00 • Then direct computations show that 



2=0 



where 



and also, that 



d a -j-r d< 
11 



dz 



2 = 



for every r] G L 2 (^, P). Consequently. 
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<p a (s;T ;g) = —E[U(s;T ;g)P t ,(z)} 



d 



dz 



2=0 



—E[U(s;T ;g)P s (z)] 



2=0 



where the second equality follows from the martingale property of P s (z) on 
(Q, {^Fjh t }T <t<t* ^) . It follows from the definition of P s (z) that 

r 

dP s {z) = ^m l z {s)P s {z)dW l {s), T <s<t; P To {z) = 1. 
i=i 

Then (|3.1|) and the Ito formula imply that 

U(s;T ;g)P s (z) =g 

+ j (AU(T ] T 0] g) + J2BiU(r ] T 0] g)]m l z (r)P T (z)dr 



i=i 



/r 
{BiU(t; T ; g) + U(r; T ; g)m l z {s)) P s {z)dW l {r) . 

To 1=1 

Taking the expectation on both sides of the last equality and setting 
ip(s, Z] T ; g) := WJ (s; T ; g)P s (z) results in 

<p(s, z; T ; g) = g + / ( A(p(r, z; T ; g) + y^m^B^T, z \ T o] g) )dr. 

1 d a 

Applying the operator —j=-^-^ and setting z = yields that the functions <f a (s; T ; g) satisfy 



dz" 



□ 



( p.6|) . Theorem |3.2| is proved. 
For a multi-index a G J define 

• I « I : = J2i, k a k (length of a); 

• d(a) := max{A; > 1 : a l k > for some 1 < I < r} (order of a). 

To study the rate of convergence of the series in ( [3.4D , it is necessary to note that the 
summation J2 a &j * s double infinite: 



(3.8) 



«gj k=0 \a\=k 
9 



and there are infinitely many multi- indices a with \a\ = k > 0. 
Define = {a G J : \a\ < N, d(a) < n} and then 

(3.9) UUs; T ; U ) = ^ T ° ; U o)UW To ,t* 



J a' 



Now the summation in fl3.9|) is over a finite set: if d(a) < n, then there are at most (nr) k 
multi-indices a with |a| = k. 

3.3. Theorem. Let the constants C , Ci,C 2 fre such that \Av\ 2 < C \v \ 2 , \e At v\ 2 < e Clt \v\ 2 , 
\Biv\ 2 < C2 |f | 2 for every vector v G M. K . If the basis {ink} is the Fourier cosine basis 



(3-10) m 1 ( S ) = - ? =L=; m k (s) = ^ F ^ ¥Q Cos( n{k T ° } ) , k > 1; T < s < £*, 



E|[/(t* ; T ; U ) - U n N {t*; T ; U Q )\ 2 < 2e^- T °> 



( [C 2 r(t*-T )] N+1 
{ {N + l)\ 

+2C 2 r ( - t *~ To) \ e(B) + C (l + C 2 r(i* - T ))(t* - T )] ) E|C/ 1 2 , 

where C = C\ + C 2 r and < e(B) < 4; e(B) = if the matrices B\ commute (in "particular, 
ifr= I). 

This Theorem is proved below in Section 7. 
If t* - T = A, then ( pTT]) becomes 



f(CA) N+1 A 2 



(3.12) E|C/(f ; T ; C/ ) - U%(t*; T ; C/ ) | 2 < c P • ' + — (c(S) + CA)J E|[/ | 2 , 
and the constant C depends only on the matrices A and £>; in (|3.1|) . 

4. Diffusion Filtering Model 

Let (fl, T, P) be a complete probability space with independent standard Wiener processes 
W = W(t) and V = V(t) of dimensions d\ and r respectively. Let X be a random variable 
independent of W and V. In the diffusion filtering model, the unobserved <i - dimensional 
state (or signal) process X = X(t) and the r-dimensional observation process Y = Y(t) are 
defined by the stochastic ordinary differential equations 

dX(t) = b(X(t))dt + a(X(t))dW(t) + p(X(t))dV(t), 
(4.1) dY{t) = h(X(t))dt + dV{t), < t < T; 

X(0)=X , F(0) = 0, 



where 6(ar) G M d , a{x) G M dxdl , G R dxr , h{x) G M r . 
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Assumption Rl. The functions a and p are Cj^IR^), that is, bounded and three times 
continuously different iable on M. d so that all the derivatives are also bounded; the functions 
b and h are Cj;(lR), and the random variable X has a density p . 



Under Assumption Rl system (|4.1| ) has a unique strong solution [0, Theorems 5.2.5 and 
5.2.9]. 

If / = f(x) is a scalar measurable function on M, d so that sup 0<i<T E|/(X(t))| 2 < oo, then the 

filtering problem for ( |4.1| ) is to find the best mean square estimate f t of f(X(t)), t < T, given 
the observations Y(s), < s < t. Denote by !Fj the cr-algebra generated by Y(s), < s < t. 
Then the properties of the conditional expectation imply that the solution of the filtering 
problem is 

; t = E(/(x(t))|jf). 

To derive an alternative representation of ft, some additional constructions will be necessary. 
Define a new probability measure P on (O, T) as follows: for A £ T, 

Z^dF, 

J A 

where 

-t 

12, 



Z t = ex P y\*(X(s))dY(s) -1 jf* 



|/i(X(s))| 2 rfs 



(here and below, if C £ then ^ is a column vector, = . . . , ^ fc ), and |C| 2 = C*0- If 
the function h is bounded, then the measures P and P are equivalent. The expectation with 
respect to the measure P will be denoted by E. 



The following properties of the measure P are well known [11, 22 



PI. Under the measure P, the distributions of the Wiener process W and the random 
variable X are unchanged, the observation process Y is a standard Wiener process, 
and the state process X satisfies 

dX(t) = b(X(t))dt + a(X(t))dW(t) + p(X(t)) (dY(t) - h(X(t))dt) , < t < T; 
X(0) = X ; 

P2. Under the measure P, the Wiener processes W and Y and the random variable X 

are independent of one another; 
P3. The optimal filter f t satisfies 

(4 ' 2) h = Vim ■ 

Because of property P2 of the measure P the filtering problem will be studied on the prob- 
ability space (fl, J 7 , P). If the function h is bounded, then there is a continuous embedding 



(4.3) 



L 2 (n,F) c Li(fi,P). 
li 



Indeed, if f e L 2 (fi,P), then 



E£ = E(Z T £) < y EZf,y E£ 2 < CyE£ 2 , 

because 

EZ 2 = E(exp|/ T |/i(X(t))| 2 rft}exp|2/ T /i*(X(t))rfF(t) -2/ T |/i(X(t))| 2 ^ 
< CEexp{2/ T /i*(X(t))rfr(t) -2/ r |/i(X(t))| 2 rft| < C 



where the last inequality follows from the property P2 of P and Proposition 3.5.12 in 
Next, consider the partial differential operators 

d r,o , . d 



i,j=l 1 3 i=l 1 



Mig(x) = hi(x)g(x) + ^2 Pu( x )-^^- ^ l = l,...,r; 



and their adjoints 



1 " d 2 d d 

i,j=l % 3 i=l * 

d d 

M*g(x) = hi(x)g(x) - (pu(x)g(x)) , l = l,...,r. 



i=l 



dxi 



Let H a be the Sobolev space {/ : (1 + H 2 ) a/2 f e L 2 (R d )}, where / = f(w) is the Fourier 
transform of /; H° = /^(R 01 ) with the norm || • ||o- The inner product in L2(M. d ) and the 
duality between H 1 and H -1 will be denoted by (•, -)o- Note that the operators C, C* are 
bounded from H 1 to H -1 , operators A4, M* are bounded from H 1 to L 2 (M. d ), and, for every 

r 

(4-4) 2(£*g,g) + £ \\M* l9 \\ 2 Q < C\\g\\l. 



The following result is well known p2| Theorem 6.2.1]. 



4.1. Proposition. In addition to Assumption Rl suppose that the initial density po 
belongs to the space H 1 . Then there is a random field p = p(t,x), t e [0,T], x G M , with 
the following properties: 

1. p e L 2 (Q x (0, T), df x dt- H 1 ) n L 2 (n, P; C([0, T], L 2 (M d ))). 
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2. The function p(t, x) is a generalized solution of the stochastic partial differential equation 



dp(t,x) = C*p{t,x)dt + ^2M*p{t,x)dYi(t), <t <T, x eR d ; 
^ ' ' 1=1 
p(0,x) =p Q (x). 

3. The equality 

(4.6) I [f{X{t))Z t \rJ] = [ f{x)p{t,x)dx 

Jm d 

holds for all bounded measurable functions f . 

The random field p = p(t, x) is called the unnormalized filtering density (UFD) and the ran- 
dom variable 4>t\f] = E [f(X(t))Z t \J-'^] , the unnormalized optimal filter. Under Assumption 
Rl, equation ( f4.5|) is at least dissipative. If the matrix a a* is uniformly positive definite, 
then equation ( |4.5| ) is coercive rather than dissipative, and it is enough to assume that 
po e L 2 (R d ). 



5. Approximation of the optimal filter 

Let {ej, i > 1} be an orthonormal basis in L 2 (IR d ) so that every belongs to H 1 . Fix a 
positive integer number K. Define the matrices A K = (Afj, i,j = 1, . . . , K) and Bf = 
(fl*, /../ 1 A': / = l,...,r),by 

Since G H 1 for all i, the matrices are well defined. The Galerkin approximation p K (t,x) 
of p(t, x) is given by 

K 

(5.1) p K (t,x) = J2pf(t)e t (x), 

i=l 

where the vector p K {t) = {pf(t), i = 1, . . . , K} is the solution of the system of stochastic 
ordinary differential equations 

r 

(5.2) dp K {t) = A K p K {t)dt + Bj < p K {t)dY l {t) 

i=i 

with the initial condition pf (0) = (p , ej) . Note that the matrices Bf, I = 1, . . . , r, do not, 
in general, commute with each other even if p(x) = 0. 

We next use Theorem to derive the Cameron-Martin version of the Wiener chaos expan- 
sion of the solution of ( |5~2"| ) . 

Let = to < tx . . . < tu — T be a uniform (for simplicity) partition of the interval [0, T] with 
step A and let {rrik{t), k > 1} be an orthonormal basis in L 2 ([0, A]). Denote by J the set of 
all multi-indices a = {a l k , I = 1, . . . , r, k > 1, a l k = 0, 1, 2, . . .} so that \a\ = J2 t k a l k < oo. 
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Define random variables 

(5.3) C k ,i= r 



and then, for a £ J, 

(5.4) & = -nI[ H ci& 



k,l)i 

a] 17 



Jn 

where HJt) = {-l) n e* 72^L e -* : l /\ 
w v ' dt n 

The following result is a direct consequence of Theorem R72 



5.1. Theorem. For every i = 1, . . . , M , the solution of ( \5.2{ ) can be written in L 2 (fl; M. K ) 



as 



(5.5) p K {U) = Y J ^=^{^P K {U-i))C a , i = l,...,M, 



, v Oil 



where, for s G (0, A] and ( G M^, t/ie functions (p^ (s; C) ore the solutions of 
(5.6) ds 

rf(0;C) =Ci{|«i=o}, 

and cx(i,j) stands for the multi-index a = (54)i<z<r, k>\ with 



A K ^(s;C) + X>L™fc( s ) S W( M) (s;C), < s < A, 



(5.7) aj 



a[ if 7^ z or I ^ j or both 

max(0, cKj — 1) if = i and / = j. 



For fixed positive integers N and n define the set as the collection of multi-indices a from 
J such that | a | < iV and a l k — if k > n. The approximation p^' n {ti) of p K {ti) is defined 
by 



(5.8) p K N n {t ) = p K (0), p K N n {U) = £ -^=^(A;p^(t^))i 



„. i = l,...,M. 



Note the p N ,n (ti) is a vector in M. K . Let U = {u 3 , j = 1, . . . , K} be a basis in M. K . The 
vector p^f' n {ti) can then be written as 

K 

3=1 
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and by the recursive definition of p^' n (tj,, 

= E ^( A ;^' B (*i))f a 

K 

aGJ™ i=l 

Once again, ^ (A, u *) i s a vector in R , so we write 

^(A,u^)=f>f fc a (U)u fc , 
k=i 



K 



and conclude that 

(5-9) p^(t i+1 ; U) = £ £ <lfk a (V) P ^; U)C- 

aG j™ fe=i 

Then 

(5.10) pf n (t i} x) = p^J(*i+i; u)^^^) 

is an approximation of the unnormalized filtering density. 

Suppose that the basis functions e k and the function / are such that 



(5.11) fk= f(x)e k (x)dx 

is defined for every k — 1, . . . , K. It follows from ( |5.10|) that 

K 

(5-12) Uf]= J^PN^^vHh 

j,k=i 

is an approximation of the unnormlized optimal filter. 

The following is a possible algorithm for computing approximations of the unnormlized 
filtering density and optimal filter using ( |5.10[ ) and ( |5.12| ). 

1. Preliminary computations (before the observations are available): 

(1) Choose suitable basis functions {e k , k = 1, . . . , K} in L 2 (M. d ), {rrii, i — 1, . . . , n} in 
Z/2([0,A]) ; and a standard unit basis {uP , j = 1, . . . K} in M. K , that is, u* = 1, 

= otherwise. 

(2) for a G and j, k = 1, . . . , K compute 

Qjk a = ( Pa,j( A i uk ) (using (|]|)), f k = f{x)e k (x)dx, p^%{t ) = / p {x)e k (x)dx; 
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2. Real — time computations, i — th step (as the observations become available): compute 



C a , «e JJ (according to ffiDJ and 

Q? k (€) = £ 



K 



k=l 



(5-13) Pn^U) = Y,Q%{C)Pn%(U-i), J = l,...,K; 

then, if necessary, compute 

(5.14) pf n (ti,x) = £^;J(t i )e i (x), 



K 



A 



(5-15) 
and 

(5.16) A = fiM. 



5.2. Remark. The main advantage of the above algorithm as compared to most other 
schemes for solving the Zakai equation is that the time consuming computations, including 
solving partial differential equations and computing integrals, are performed in advance, 
while the real-time part is relatively simple even when the dimension d of the state process 
is large. Here are some other features of the algorithm: 

(1) The overall amount of preliminary computations does not depend on the number of 
the on-line time steps; 



(2) Formulas ( 5.15 ) and ( 5.16 ) can be used to compute an approximation to f ti , for ex- 
ample, conditional moments, without the time consuming computations of p^ n (ti,x) 
and the related integrals; 

(3) Only the coefficients must be computed at every time step while the approx- 
imate filter f t . and/or UFD p^' n (ti,x) can be computed as needed, for example, at 
the final time moment. 

(4) The real-time part of the algorithm can be easily parallelized. 

(5) Even though the coefficients q^ a are computed according to ( |5.6|) , their values can 
be further adjusted by simulating the state and observation processes and computing 
the corresponding filter estimates. 
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(6) If n = 1, then each depends only on the increments Yi(ti) — of the obser- 

vation process. For n > 1 and k > 1, the integral / — tj_x)rfVJ(s) can be 

Jti-i 

reduced to a usual Riemann integral and then approximated by the trapezoidal rule. 



6. Rate of convergence 

To study the convergence of the algorithm, it is necessary to specify the bases {e^, k > 1} 
on R d and {m h i > 1} on [0, A]. 

Let {ck-,k > 1} be the Hermite basis in L 2 (lR d ). The basis can be described as follows. 
Denote by T the set of ordered d-tuples 7 = (71, ... , 7^ with jj = 0,1,2,.... For 7 G T 
define 

d 

i=i 

where 

^ , / n ( — l) n +2 d n ,2 
H k (t) = f e ^e - * 

With this definition, 7Y 7 is the eigenfunction of the self-adjoint operator A = — V 2 + (l + \x\ 2 ): 
where V 2 is the Laplace operator and A 7 = (2|7| + d + 1). 

To define an ordering of the set T, we define | — y | = Yl d j=\lj an d then say that 7 < r if 
I7I < |r| or if |7| = \t\ and 7 < r under the lexicographic ordering, that is, 7j < r io , where 
i is the first index for which 7, 7^ T{. The basis {tk\k>\ is then the set {7^ 7 (x), 7 G T} 
together with the above ordering of the set T so that Aek = Xk&k and x k l / d . 

Next, we define an orthonormal basis {mj in L 2 ([0, A]) by 

1 pl f n(k — l)s\ 
mi(s) = mfc(s) = \ -t- cos ( a ) ? ^ > 1; < s < A. 



A' V A V A 



6.1. Definition. The filtering model (|4.1|) is called ^-regular for some positive integer 
v if the functions a and p belong to C 2z/+3 , the functions b and h belong to C 2l/+2 , and 
A u p G H 1 . 



6.2. Theorem. If the filtering model is v-regular, in the sense of Definition \6. 1[ for 

some v > d + 1 and 

C p = max sup \p u (x)\ 2 , 

*> l X 
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then 
(6.1) 



K,nt. \\\2 ^ C(v, T) 



ma^EIMV) -p N ' n (ti,-)\\ < K 2(u- d -iy d 
~~f r (l + C p K^ d )A + (K 2 / d + C p K^ d )A 2 (C(l + C p KV d )) N ^A N \ c{1+CpK ^ )T 
+ \ n + (N + l)\ J 

The number C(v,T) depends on u,T, and the parameters of the model (coefficients of the 
equations t \j.l\ j). The number C depends only on the parameters of the model. 

If, in addition, (1 + \x\ 2 )~ w f G L 2 (M. d ) for some w > so that v > d + 1 + w and 

A"((l + |x| 2 )> ) e H 1 , then 
(6.2) 

max E|&.[/] -Mf}\ 2 < C i< ,T,W , ) ul 

0<i<M lYhU S ^'WJI - ^2(u-w-d~l)/d 

" [ Jl + C p K^)A + (K*/* + C p K*/ d )A 2 {C{l + C p K^ d )) N ^A N \ c{1+CpK , /d)T 
+Cf {° n + (ivTT)! ) 6 

The number C(v,T,w) depends on u,T,w, and the parameters of the model; the number C 
depends only on w and the parameters of the model; Cf = J" R d(l + \x\ 2 )~~ 2w \f(x)\ 2 dx. 

Proof. By Theorem |2.1| , 

(6.3) E||p(V)-P*(V)l|g< 



10 — j^2{v-d-l)/d' 

Indeed, by Theorem 4.3.2 in |2"2|| , sup 0<t<T 

E||A"p(t,-)||o < e CT ||A^o||o ; where C depends 
only on v and the parameters of the model. Also, in the notations of Theorem |2.1| , 9 = 1/d, 
q = l/(2d), and 6 1 = {v - l)/d. 

To simplify the further presentation, set k = K x l d and define C K = 1 + C p k. Then, to prove 
( |6.1D , it remains to show that 

t\p K {U)-p K N ' n {U)\ 2 < ( C 9^±jf^L + {G ^} N+ 'f ) e cc « T , 



n (N + l)\ 

and by Theorem |3.3| this inequality holds if, for every vector ( e M. K , 

(6.4) \A K (\ 2 < Ck\\\ \B* Ci 2 < CC K \C\ 2 , \e tAK (\ 2 < e ct \(\ 2 . 

Because of the multi-step approximation, we, as usual, loose one power of A in ( |3.12|) . In- 
equalities ( |6.4| ) are verified by direct calculations using that the operators Ar l C and Ar l l 2 M.i 
are bounded in L2(M. d ). 



To prove (|5~g), let f3(x) = + \x\ 2 and, for weR, define the space L 2iW (R d ) = {/ : f (3 W e 
L 2 (R d )}. Clearly, L 2>w (R d ) is a Hilbert space with inner product (f,g)o, w = (f(3 w , 9f3 w )o and 
norm = (f,f)o,w Then 



(6-5) \<f> u \f\ - <P u [f}\ 2 < CMU, ■) - P T(ti, OIloV 
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Using the calculus of pseudo-differential operators p3], Chapter 4], we conclude that, for 
every g G L 2 , 2 w(^ d ), 

(6.6) 



l#l|o,2tu 



\\P 2w g\\ < C||A-/3 2 -A^|| < C||A^|| . 



Therefore, by Theorem 2.1 and Theorem 4.3.2 in [E2 



(6.7) 

Next, (|6.6j ) implies 



n\ P {t h -)-p K {t 



< 



i; )\\0,2w — ^2{u-w-d-l)/d- 



K 



E\\p"(t h -)-pf n (t 



■ii 7ll0,2u> 



< 



k=l 



Define the diagonal matrix A = (A-ij)ij=i,.. .,k by Ajj = Xf. Then define the matrices 

A K = AA K A-\ Bf = kBfPC 1 , 

and the vectors pit) = Ap K (t), p%{ti) = Ap^ n {ti). With these definitions, the vector p K (t) 
is the solution of 

r 

dp K (t) = A K p K {t)dt + B?p{t)dYi(i) 

i=i 

with the initial condition pjf(O) = \k(.Po, e k)o, the vector p% ,n {t) satisfies 



PN' n (t ) =p K (0), p K /\U) = £ ^a(^PN' n (^l)K 

— v cy' 

~.s- T77. V • 



i = 1, . . . , M, 



and 

(6.8) E|b*(V) -PN' n (ti,-)\\l w < Ct\p K {U)-p K N n {U)\\ 

The functions (p^ satisfy the equations (|5.6|) with A K and Bf instead of A K and B K . 
Direct computations show that, for all ( G 1R K , 



(6.9) 



i*C| 2 <^ 2 |C| 2 , \Bf(\ 2 < CC K \(\ 2 , |e^Cp<e^|C| 



_2|A|2 \nK/-\2 



tA K a 1 2 ^ „Ct\/-\2 



with C depending on w and the parameters of the filtering model. By Theorem [3.3| we then 
conclude that 



np K {ti)-PT\w<[c 



K,n/, \\2 ^ ( ^CftA + K 2 C K A 2 (CC K ) N+1 A N \ CC K T 



n 



(N + V 



Together with ( |6.5| ), fl6.6| ), and ( 6T% ), the last inequality implies 
Theorem |0 is proved. 



□ 
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7. Proof of Theorem |373| 



The proof requires an explicit formula for the solution of (|3.6|) . We begin with some auxiliary 
constructions. 

Every multi-index a with |a| = k can be identified with the set K a = {(i", qf), ■ ■ ■ , q%)} 
so that i" < if < . . . < i\ and if if = if + i, then qj < qf + i- The first pair (if, qf) in K a is 
the position numbers of the first nonzero element of a. The second pair is the same as the 
first if the first nonzero element of a is greater than one; otherwise, the second pair is the 
position numbers of the second nonzero element of a and so on. As a result, if <x- > 0, then 
exactly a| pairs in K a are (j,q)- The set K a will be referred to as the characteristic set 
of the multi-index a. For example, if r = 2 and 

_/0102300...\ 
" ~ ^ 1 2 1 . . . J ' 

then the nonzero elements are a\ = a\ = a\ = 1, a\ = a\ = 2, a\ = 3, and the 
characteristic set is K a = {(l, 2), (2,1), (2,2), (2,2), (4,1), (4,1), (5,1), (5,1), (5,1), (6,2)}. 
In the future, when there is no danger of confusion, the superscript a in i and q will be omitted 
so that (ij, qj) will be written instead of (i", q"). 

Let V k be the permutation group of the set {1, . . . , k}. For a given a G J with \a\ = k and 
the characteristic set {(i 1; qi), . . . , (i k , q k )} define 

E a (s k ]l k ) := ^2 m *i('Mi)) 1 {^(i ) =gi} ' " ■ m i k ( s *(k)) 1 {l„ lk )=<ik}- 

The following notations are introduced to simplify the further presentation: 

• s k , the ordered set (s\, . . . , Sk); ds k := ds\ . . . dsk, 

• l k , the ordered set (Zi, . . . ,l k ); 

• $ t = e At ] 

• F(t; s k ; l k ; g) : = $ t _ afc B Ifc $ -h _ SJk _ 1 . . . B h $ sl _ To g, k > 1; 

r(k,i) i>t i>s k i-s 2 

• / {■■■)ds k := //•••/ (■■■)ds 1 ...ds k ; 

J To J To J To J To 

r 

•E- E ■ 

V> h,...,l k = l 

Note that 

(7.1) \F( t] s k ;l k ;g)\ 2 <C k e c ^- T %\\ f™ ds k = { ^^, V 1 = r k . 

Jto kl V 



7.1. Proposition. If a £ J is a multi-index with \a\ = k and the characteristic set 
q\), . . . , (ifc, qk)}, then, for t 6 [T , t*], the corresponding solution <p a (t] To] U$) of ( ^.dj) 
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is given by 

<p a {t;T ) U ) = 
(M) 

F ^^ sA ^^^o)m iCTW ( Sfe )l fc=gCT(fc)} ---m iCT(1) ( Sl )l {k=gff(1)} d/, fc>l; 

(7.2) CTe pfc ^ o 



'To 

<£> a (t; T ; £7 ) = *t-T ^o, fc = 0, 



and 



(7.3) £ kgafeTo^f = £ I ' gfe . z * ; ^js^fc 
|a|=fc a ' 

Proof. To simplify the notations, the arguments To and Uq will be omitted wherever 
possible. Representation (|7.2|) is obviously true for \a\ = 0. Then the general case |a| > 1 
follows by induction from the variation of parameters formula. 

To prove (|7.3Q , first of all note that 

= Yl m *fc( S ^(fc)) 1 {' CT (fc)='?fc}'" m n( S ^(l)) 1 {' CT (i)=9i}- 
a£V k 

Indeed, every term on the left corresponding to a given <jq G V k coincides with the term on 
the right corresponding to a^ 1 G V k . 

Then ( |7.2| ) can be written as <p a (t) = J2 lk J^ M) F{t; s k ; l k )E a {s k ; l k )ds k . Using the notation 
G(t;s ;l):= $> t _ S(7 ^ k) Bi a ^ k) . . . ^s ct(2) -s ct(1) A CT(1) ^s CT(1) -T fi'ls (T(1) <...<s (T{fc) <t) 

it can be rewritten as 

(7.4) <A*(*) = ^E / G(t;s k ;l k )E a ( S k ;l k )ds k . 

lk [T ,t*] k 

Since for every £ G [T ,t*] the function G(t; s k ; l k ) is symmetric, 

Si ^ 
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with some vector coefficients cp(t). This and ( |7.4| ) imply \(p a (t)\ 2 /a\ = \c a \ 2 /k\ and so 

\^a(t)\ 2 

k\ 

\a\=k 



E 

|a|=fc 



a l 



|«|— n. [T ,t*] k 

ftj / | ^*- s <r(fc)^ CT (fe) • • • ® s <T(2)-s<T{i)Bl (TW ®s a{1) -Tagls tT(1) <...<s tT(k) <t 

lk {T ,t*] k ^ 



£ / \F(t;s k ;l h )\ 2 ds* 



To 



which proves (|7.3|) . Proposition |7.1| is proved. □ 

We continue by considering the truncation only of the length of a. Define Jjy = {a E J : 
\oc\ < iV} and 



(7.5) 



U N {s;T ;U ) = V -^<p a (s; T ; U )UW To ,r)- 
~i v«! 



Note that the summation in (17.51) is still infinite 



7.2. Proposition. In the notations of Theorem [5^ 
(7.6) sup E\U(s;T ;U ) - U N \s;T 0] U Q )\* < ^f'^' e ^ E\Uo\ 2 - 

Proof. To simplify the presentation, the arguments To and Uq will be omitted wherever 
possible. 

By Theorem pM| , 

|^( S )|2 _ HM 



(7.7) E^T^ = E/ \F{s;W)\W. 



\a\=k 



T 



Since the random variables £ a (WT ,t) are uncorrelated and are independent of Uq, formulas 

\ E|y a (, 
✓ |a|=fc a\ 



O and O imply E\U(s) - U N (s)\ 2 = £ fc>iV E M =k ^f^- By 0, 



^ ^ Elv^)! 2 ^ eCi(s _ To) e|[/q|2 ^ (C 2 r(s-T )) k 

k>N \a\=k ° L ' k>N ^' 

which completes the proof of Proposition [7.2| . □ 
Now we truncate the sum in ( |7.5| ) even more by restricting a to the set J^. 
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7.3. Proposition. In the notations of Theorem PO and Proposition [7~§, 



(7.8) 



E\U N (t*;T ;U )-U^(t*;T Q ;U )\ 2 < 2C 2 r e G ^- T °He(B)^^ 

\ n 



o) 2 



C (1 + (t* - T )C 2 r) {t * r ° )3 )E]^o| 2 - 

n / 



Proof. To simplify the presentation, the arguments T and Uq will be omitted wherever 
possible. 

If a is a multi-index with |a| = k and the characteristic set {(«i,?f) • • • , then 
i% = d(a), the order of a, and so the set J 1 ^ can be described as {a G J : |a| < AT; i?, < n}. 
Since the random variables £ Q are uncorrelated and are independent of Uq, 



oo N ml /lJ|t v|2 



E|t^(o-^(oi 9 = X) E E E| ^ (r)l 



a! 

fe=n+l k=l \a\=k;i%=b 



oo V , /,*\|2 



\<P a {t*)\' 



The problem is thus to estimate 

b=n+l fe=l | a |=fc;i£=& ' ' ' 

By Theorem |7.1| the corresponding solution (p a of fl3.6| ) can be written as 

.(fc,t*) 



(7.9) p a (t*) = V / ; s fc ; Z fc )£ a (s fe , Z fc )^ fc 



According to (|3.7p, the characteristic set of «(«&, (&) is {(zi, gi), . . . , (ifc-i, <?fc-i)}; 
therefore, it is possible to write 

k 

E a {s k ) =Y,m k {s j )l { i J=qk} E a{ik>qk) {s k j] l';), 
i=i 

where (resp. denotes the same set (si,...,Sk) (resp. (Zi, . with omitted Sj 

(resp. /j); for example, = (s 2 , . . . , Sfc). 

As a result, after changing the order of integration in the multiple integral, equality ( |7.9[) 
can be rewritten as 

(7.io) <^(**)=EE / ( / SJ+lF (^; s ^ z 1^( s i) 1 ft=^^)^fe,^(4;?) ds ^ 

i=l {* ^ ■ /s i" 1 

where s := T ; s k+ i := t*. 
Denote 

y/2(t* - T ) . / 7r(fc — !)(£ — Tp) \ 

Mfc(s) := — — -; : — sin ; — : k > 1, T < s < t , 

v ; tt(/c — 1) V (t* -T ) /' _ _ ' 
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dF(t*; s k ; l k ) 

and Fj : = — - . Then, as long as ik — b > 1, integration by parts in the inner 

integral on the right hand side of (|7.10| ) yields: 

3+1 F(t*;s k ;l k )m b (s j )ds j 



i-i 



Sj=S j + 1 



s j+i 



F{t*; s k ; l k )M b ( Sj ) J - I F^t*; s k ; l k )M b ( Sj )d Sj . 

For each j, let us rename the remaining variables s k in (|7.10|) as follows: ti := Si,i < 
j — 1; ti := s i+ i,i > j — 1, or, symbolically, t k_1 := s k . We will set t := T , t k := t* and 
denote by t = 1, . . . , k — 1, the set t k ~ 1 in which tj is repeated twice (e.g. t 1,1 = 

(ti, t\, . . . , tk-i), etc.); also t fe_1 '° := (t 0j ti, t-z, ■ ■ ■ , tfc-i), t fc ~ 1,fc := (t\, . . . , t^-i, 

The similar changes will also be made with the set l k : for fixed j, there are k — 1 free indices 
l\, . . . , lj-i, lj + i, ...,1k and they are renamed just like s k to form the set l k_1 (in this case, the 
same symbols are used). Similarly, Z fe_lj denotes the set (/i, . . . , lj-i, qk, lj, ■ ■ ■ , h-i)- After 
these transformations, E a ^ kqk ^(s k ;l k ) becomes E a ^ km ^(t k ^ 1 ; Z fc_1 ) - independent of j, and 

F(t*;s k ;l k )l {lj=qk} M b ( Sj 

= F(t*; t k -^; l k -^)M b {tl)-F(t*; t k ~^~ l - l^M^), j = 1, . . . , k. 
Therefore, if d(a) = b > 1 and \a\ = k > 0, then 

(fc-i,t*) 



,(o = E / (/iV^-v*- 1 ) 

^ To 

+ /& 2) (^*; * fe_1 ; ^ -1 ))- £ 'a(t fc ,g fc )(* fc-1 ; i k ~ l )dt h 



where 



- F^jt*- 1 ^- 1 ;/*-^)^/^^!)) iffc>l, 



= if = 1 - because M b (t ) = M b (t k ) = (this is the only place where the choice of 
{mk} really makes the difference), and 

/fCt*;**" 1 ;!*- 1 ) = - F 1 (r;s,t k - 1 ;q k ,l k - 1 )M b (s)ds 



'To 



£ / J F,-(t*; . . . , £j-_i, s, ^, . . . ; l k ~ 1,j )M b (s)ds 

-1=9 



j=2 " <"J- 

" F k (t*- } t k -\ S ;l k -\q k )M b (s)d S . 

'tk-l 
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Note that if the operators B\ commute with each other, then / 6 (t*; t k 1 ; l k 1 ) is identically 
equal to zero for all k. 

Since |a(z| a |, q\ a \)\ — \a\ — 1 and a\ > a{i\ a \,q\ a \)\, it now follows from ( |7. 10|) that 



E 



\f>a{f)? 



\a\=k;i%=b 

- E E |4t E / ,< " M ' , (/i 1) + f?)E aM d^ 2 



|a|=fc;i«=6 9fc =l V "- /fe-i JT 

<E E I^Ef^V + zrW*" 2 

4 ' V P ! ; fe -l>/T 



9fe=l |/3|=fc- 



and the proof of Proposition [7J] shows that the last expression is equal to 



(7.11) 



g ft =i/fe-i JT ° 



(k-l,t*) 



fl l \e ] t k -\i k - 1 ) + fl 2 \r ] t k ' l -i k - 1 ) 



dt 



k-1 



Definition of / b implies 



(7.i2) i/ b {i) r = o, fc=i ; i/ri 2 < 



fc(C 2 ) fc e(g)(^-T )^ i(t ,_ 



(6-1)= 



iC/oh A;>2. 



Next, direct computations yield 



F j {t*;s k ;l k ) 



so that by assumption (3) of the theorem, \Fj(t*; s k ; l k )\ 2 < C (C 2 ) k e Cl ^- T ^ \U \ 2 . 
After that the definition of k implies: 



l/fi 2 < AC k(C 2 ) k e c ^- T ^\U \ 2 (t*-T ) / (M b (s)) 2 ds 

k * 3 ° 

< Cofc(C 2 ) (t* - Tp) c C l( f*- To ) |2 . 



(6-1)= 
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(*-!,**) 



so, since 



eft 



fc-i 



(t* - To) k - l /(k - 1)!, (frllD , (TO) and the last inequality yield 



T 



N 



E^(o-^(oi 2 = EE E 

6>n+l fe=l | a ]=fc;i£=& 

fc + 2 (C 2 r(r - T )) fc 



<C7 2 re^-^[ C (S)(f-2?)E^y- 



/c>0 



fc! 



+C (t*-T ) 3 E 



(k + l)(C 2 r(t*-T )Y 



k>0 



k\ 



< 



2CV e c (**- T o) 



[e(£)(t* - T ) 2 + (1 + (t* - T )C 2 r)C (t* - T ) 3 ] E\U \ 2 . 



This completes the proof of Proposition |7.3| . The statement of Theorem |3.3| now follows from 
Propositions [T2] and [7I5| . 
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